Submission 2 - Intermediate¶

Instructions

  • Fill out the 'Basic Information' and 'Simple Hypotheses - Findings / Insights' sections (replace the TODOs with your information)
    • If your visualization is interactive, create a short video that shows the interactivity (max. 3 min).
  • Export the Jupyter notebook as a HTML file and save it in this repository.
    • Go to 'File' -> 'Save and Export Notebook As..' -> 'HTML'.
    • Save it locally in your project folder. You can leave the name as it is.
    • If you have a video, then add it to the project folder.
    • Push your local changes to the GitHub repository.
  • Upload the generated HTML for the Moodle submission.
    • If you have a video, then upload that for the Moodle submission, too.

Basic Information¶

Team Name: The Fantastic Four

Team Members:

  • Mohamed Abdelaziz
  • Ivan Drinovac
  • Hanna Messner
  • Ádám Párkányi

Dataset Description:
(max. 75 words)

Historical Data of various causes of deaths across different countries along the timeline from 1990 up until 2019, Combined with The Population by country to derive more insight into the data


Simple Hypotheses - Findings / Insights¶

Instructions:

  • Add your visualizations (max. 4) here
    • The visualizations should show the findings / insights
  • Add the descriptions for all your findings / insights (max. 150 words)
    • The description should also relate to the visualizations above.

Visualizations¶

In [1]:
import pandas as pd
import utils

df = pd.read_csv('../data/cause_of_deaths_cont_pop.csv')
In [2]:
# Hypothesis 1 (World Map):
utils.plot_hyp_1_1(df)
Correlation Coefficient of the Fatalities due to (Alcohol Use Disorders) and (Cirrhosis and Other Chronic Liver Diseases) per Country from 1990 to 2019.
In [3]:
# Hypothesis 1 (Matrix):
utils.plot_hyp_1_2(df)
Overall Correlation Matrix of the Fatalities due to (Alcohol Use Disorders) and (Cirrhosis and Other Chronic Liver Diseases) Worldwide from 1990 to 2019.
In [4]:
# Hypothesis 2 (World Map):
utils.plot_hyp_2_1(df)
Comparison Between the Absolute Values and Relative Values of Total Road Injuries Fatalities worldwide from 1990 to 2019
In [5]:
# Hypothesis 2 (World Map):
utils.plot_hyp_2_2(df)
Average Growth rate of the Road Injuries Fatalities by country from 1990 to 2019

Description¶

Hypothesis 1
    | There exists a correlation between the number of deaths caused by Chronic Liver deseases and Alcohol-use disorders.

     Interpretation:
Based on the Correlation matrix plot we can deduce a moderate positive correlation between the 2 variables with a value of 0.58, the relationship was furtherly explained and reinforced by the second plot of the correlation coeffecient between the 2 variables for each country over the span of 30 years from 1990 to 2019, with the highest Positive Correlation in the US and the highest Negative Correlation in Austria.


Hypothesis 2
    | There is a continuous raise in the Road Injuries Deaths recorded on a global scale.

     Interpretation:
Comparing the Absolute Values and The Relative Values, over the total Population, of the Fatalities due to Road Injuries showed that while the Overall Number of deaths seems to be increasing, the percentage of it to the population is evidently decreasing for different continents, mainly Africa, Asia and Europe, while staying stalled in the rest; which was reflected in the second graph of the average growth rate per country over the span of 30 years from 1990 to 2019.

In [ ]: